Picture for Li Wang

Li Wang

Northeast Normal University

ProjQ: Project-and-Quantize for Adapter-Aware LLM Compression

Add code
May 30, 2026
Viaarxiv icon

ZipRL: Adaptive Multi-Turn Context Compression with Hindsight Response Replay

Add code
May 27, 2026
Viaarxiv icon

Learning to Adapt SFT Data for Better Reasoning Generalization

Add code
May 26, 2026
Viaarxiv icon

When Self-Belief Misleads: Active Label Acquisition for Reinforcement Learning with Verifiable Rewards

Add code
May 25, 2026
Viaarxiv icon

DrawMotion: Generating 3D Human Motions by Freehand Drawing

Add code
May 20, 2026
Viaarxiv icon

Implicit Hierarchical GRPO: Decoupling Tool Invocation from Execution for Tool-Integrated Mathematical Reasoning

Add code
May 18, 2026
Viaarxiv icon

Fill the GAP: A Granular Alignment Paradigm for Visual Reasoning in Multimodal Large Language Models

Add code
May 12, 2026
Viaarxiv icon

LINC: Decoupling Local Consequence Scoring from Hidden Matching in Constructive Neural Routing

Add code
May 07, 2026
Viaarxiv icon

MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems

Add code
May 07, 2026
Viaarxiv icon

ATRS: Adaptive Trajectory Re-splitting via a Shared Neural Policy for Parallel Optimization

Add code
Apr 24, 2026
Viaarxiv icon